High density flash memory architecture with columnar substrate coding

ABSTRACT

A flash memory device includes a substrate having first and second wells. The first well is defined within the second well. A plurality of trenches defines the substrate into a plurality of sub-columnar active regions. The trenches is formed within the first well and extends into the second well. A plurality of flash memory cells are formed on each of the sub-columnar active regions.

CROSS-REFERENCES TO RELATED APPLICATIONS

[0001] The present application is related to and claims priority from U.S. Provisional Patent Application No. 60/362,348, filed on Mar. 5, 2002, which is incorporated by reference.

BACKGROUND OF THE INVENTION

[0002] Flash memory was originally developed as a derivative of Erasable Programmable Read Only Memory (EPROM). Conventional EPROM technology uses hot electron injection (also called avalanche injection) to program the memory and ultraviolet (UV) light to erase the contents of the memory. Avalanche injection of electrons into the floating gate is achieved by applying high positive voltage to both the drain and the control gate, and grounding the source. Exposing the cell to UV light increases the energy of the floating gate electrons to a level where they may jump the energy barrier between the floating gate and the oxide.

[0003] Conventional single-transistor cell flash memory technology is similar to single-transistor cell EPROM technology. However, flash memory allows for electrical erasure of the contents of the memory, either of the entire memory array at-once or of a sector of the memory at once, by way of cold electron tunneling (also called Fowler-Nordheim tunneling).

[0004] An example of a conventional single-transistor cell for flash memory is illustrated in FIG. 1. Such a flash memory cell typically has thinner oxide under the floating gate (between the floating gate (106) and the channel) than an EPROM cell has. The thinner oxide allows for erasure to be achieved via cold electron tunneling between the floating gate (106) and the source (104).

[0005] Like programming of EPROM, programming of conventional single-transistor cell flash memory is typically performed by applying high positive voltage to both the drain (102) via the bitline and the control gate (108) via the wordline, while grounding the source (104). This causes hot electron injection from the substrate (101) near the drain (102) to the floating gate (106). This programming by way of hot electron injection is crude in that the charge stored in the floating gate (106) is difficult to control precisely. This inability to control precisely the charge stored in the floating gate (106) is a first disadvantage of conventional single-transistor cell flash memory. This disadvantage makes it difficult to store multi-levels (i.e. more than one bit of information) in a flash cell.

[0006] Erasure of conventional single-transistor cell flash memory may be performed by applying a high positive voltage (for example, plus 12 volts) to the substrate (101) and grounding the control gates (108) in a sector. This causes the tunneling of the electrons from the floating gates (106) to the sources (104). Portions of the memory smaller than a sector cannot be erased because the common substrate is shared by all cells in a sector. The size of a sector may be, for example, 512 kilobits of cells for a 4 megabit flash memory organized into 8 sectors. Thus, the inability to erase portions of the memory smaller than a sector is a second disadvantage of conventional single-transistor cell flash memory.

[0007] As an alternative to using single-transistor cells, conventional flash memory may instead utilize cells with two or more transistors. For example, each cell may include two transistors: one being a select transistor; and the other being a storage transistor. Utilizing such multiple-transistor cells, erasure of portions as small as a single word have been achieved. However, such multiple-transistor cells are substantially larger than single-transistor cells, and hence are not suitable for high density flash memory applications.

[0008] Cell size in conventional flash memory is limited by cell punchthrough requirements. Cell punchthrough occurs when the depletion region of the drain junction merges with the depletion region of the source junction. In order to prevent cell punchthrough, a minimum distance is typically required between drain (102) and source (104) along a bitline (or column). The higher the maximum voltage applied to a drain during operation, the larger the minimum distance must be. In this way, cell punchthrough limits the size of cells along the columnar direction, and so is a third disadvantage of conventional single-transistor cell flash memory.

SUMMARY OF THE INVENTION

[0009] The present invention relates to flash memory technology.

[0010] In one embodiment, a flash memory device includes a substrate having first and second wells, the first well being defined within the second well. A plurality of trenches defines the substrate into a plurality of sub-columnar active regions. The trenches are formed within the first well and extend into the second well. A plurality of flash memory cells are formed on each of the sub-columnar active regions.

[0011] In another embodiment, a flash memory device includes a substrate having first and second wells. The first well is defined within the second well. A plurality of deep trenches define the substrate into a plurality of bitlines. the trenches are formed within the first well and extend into the second well. Each bitline includes a first region defined by the trenches whereon a plurality of flash memory cells are provided and a second region defined by the trenches whereon a plurality of flash memory cells are provided. The first and second regions separated by the trenches. A connecting region has first and second pass transistors to regulate flow of currents to the first and second regions.

[0012] Yet in another embodiment, a method of operating a non-volatile memory device includes programming a floating gate of a memory cell. The memory cell is erased by applying a first potential to a control gate of the memory cell and applying a second potential to a sub-columnar active region corresponding to the memory cell.

BRIEF DESCRIPTION OF THE DRAWINGS

[0013]FIG. 1 is a cross-sectional schematic diagram illustrating a conventional single-transistor flash memory cell.

[0014]FIG. 2 is a cross-sectional schematic diagram illustrating a conventional triple-well substrate configuration.

[0015]FIG. 3A is a cross-sectional schematic diagram illustrating a triple-well substrate configuration including deep trench isolation (DTI) in accordance with one embodiment of the present invention.

[0016]FIG. 3B is a cross-sectional schematic diagram illustrating a triple-well substrate configuration including shallow trench isolation (STI) in accordance with one embodiment of the present invention.

[0017]FIG. 3C is a top-view schematic layout illustrating trenches and active substrate regions in accordance with one embodiment of the present invention.

[0018]FIG. 4A is a cross-sectional schematic diagram illustrating a single-transistor flash memory cell along a bitline in accordance with one embodiment of the present invention.

[0019]FIG. 4B is a cross-sectional schematic diagram illustrating a single-transistor flash memory cell along a wordline in accordance with one embodiment of the present invention.

[0020]FIG. 5A is a first top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention.

[0021]FIG. 5B is a second top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention.

[0022]FIG. 5C is a third top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention.

[0023]FIG. 6A is a top-view schematic layout illustrating trenches and active regions in accordance with another embodiment of the present invention.

[0024]FIG. 6B is a cross-sectional view of FIG. 6A taken along the arrows AA.

[0025]FIG. 6C is a top view of schematic layout illustrating an N+/P+ contact region.

DESCRIPTION OF THE SPECIFIC EMBODIMENTS

[0026]FIG. 1 is a cross-sectional schematic diagram illustrating a conventional single-transistor flash memory cell. The conventional cell is shown as being based on a triple p-well (101) which is the topmost well of a conventional triple well substrate configuration. Such a conventional triple well substrate configuration is illustrated in FIG. 2.

[0027] The conventional cell includes a n+ doped drain (102), a n+ doped source (104), a floating gate (FG) (106), a control gate (108), oxide (110), a drain contact (112), and a source contact (114). The drain voltage is represented by Vd, the source voltage is represented by Vs, and the control gate voltage is represented by Vg. Flash memory is conventionally configured such that the wordline [WL] is the control gate (108) and the bitline [BL] connects to the drain (102).

[0028] The conventional cell is typically programmed by hot electron injection from the channel (the substrate region between the drain (102) and the source (104)) to the floating gate (106). This is typically achieved by applying a high positive voltage to both the drain (102) and the control gate (108) while grounding the source (104). Electrons are thus injected into the floating gate (106). When the cell has been programmed, there is a negative charge on the floating gate due to the electrons collected thereon. The negative charge raises the threshold voltage of the device as seen from the control gate (108). Hence, with the application of a positive voltage on the control gate (108), the transistor will not turn on, as long as the control gate voltage is lower than the cell threshold voltage.

[0029] The conventional cell is typically erased by cold electron tunneling by applying a high positive voltage to the source (104) or to the Pwell (101) while grounding the control gate (108). This causes electrons from the floating gate (106) to tunnel to the source (104). When the cell has been erased, the collection of electrons-on the floating gate (106) has been removed. This reduces the threshold voltage of the device. Hence, when a positive voltage is applied to the control gate (108), the transistor will turn on.

[0030] Note that FIG. 1 is for purposes of illustration and is not drawn to scale. For example, source and drain may be more asymmetrical than is shown in FIG. 1, with different junction depths. Similarly, the other figures in this application are also for purposes of illustration are not drawn to scale.

[0031]FIG. 2 is a cross-sectional schematic diagram illustrating a conventional triple-well substrate configuration. The configuration includes a p doped substrate (202). Within the p doped substrate, a n doped well (204) is formed. Within the n doped well (204), a p doped well (101) is formed. It is on this Pwell (the triple Pwell) (101) that the cell shown in FIG. 1 is formed.

[0032] As discussed above, the present invention relates to isolating the substrate for each bitline (or column) of cells from the substrates of the other bitlines. This isolation is not feasible to achieve by creating separate Pwells during the Pwell implant because, at least for small bitline separations (for example, less than 0.5 micrometers given current technology), the separate Pwells would merge due to lateral diffusion. Hence, the present invention relates to using trenches to achieve the isolation. Two options for using trenches to achieve the isolation are illustrated in FIGS. 3A and 3B. This concept is also explained in U.S. Pat. No. 6,198,658, assigned the present assignee, which is incorporated by reference for all purposes.

[0033]FIG. 3A is a cross-sectional schematic diagram illustrating a triple-well substrate configuration including deep trench isolation (DTI) in accordance with one embodiment of the present invention. In addition to the triple well substrate configuration of FIG. 2, the configuration of FIG. 3 includes a deep trench (302) which is filled with oxide. In one embodiment, the deep trench (302) extends to a depth somewhat below the bottom of the Pwell (101) in order to ensure isolation of active substrate regions (304) within the Pwell (101). To ensure isolation along the sidewalls of the trenches (302), the sidewalls may be implanted with p-implant. P-implant in the trench sidewalls increases the threshold (turn on) voltage for the trenches thereby ensuring that they remain in the “off” state and serve as isolation regions.

[0034]FIG. 3B is a cross-sectional schematic diagram illustrating a triple-well substrate configuration including shallow trench isolation (STI) in accordance with one embodiment of the present invention. Instead of using the deep trench isolation of FIG. 3A, the configuration of FIG. 3B uses a combination of a deep n+ implant (306) followed by a shallow trench (308) which is filled with oxide. It would be problematic to utilize n+ implant alone to fully isolate the active substrate regions (304) along the entire depth of the Pwell (101). So, instead, n+ implant (306) is used in the deep regions, and shallow trenches (308) filled with oxide are used in the shallow regions above the n+ implants.

[0035] Although for purposes of illustration only one active substrate region (304) is shown in FIGS. 3A and 3B, multiple active substrate regions (304) are formed in accordance with this invention. Each of the active substrate regions (304) being isolated from the other regions (304) by trenches (302 or 306+308), and each of the active substrate regions (304) comprising an isolated columnar substrate onto which a column of flash memory cells are formed.

[0036]FIG. 3C is a top-view schematic layout illustrating trenches (302 or 306+308) and multiple active substrate regions (304) in accordance with one embodiment of the present invention. The multiple active substrate regions (304) are shown separated by the trenches (302 or 306+308) in a columnar configuration, where each isolated active substrate region (304) corresponds to a column or bitline.

[0037]FIG. 4A is a cross-sectional schematic diagram illustrating a single-transistor flash memory cell along a bitline in accordance with one embodiment of the present invention. The cell of FIG. 4 differs from the cell of FIG. 1 in a few significant aspects. First, the cell of FIG. 4 is formed on an isolated columnar active substrate region (304) of the substrate while the cell of FIG. 1 is not. Second, an electrical connection (402) is made to the isolated columnar active substrate region (304). Third, because of the electrical contact (402) to the isolated region (304), erasure of the cell may be performed using cold electron tunneling from the floating gate (106) to the isolated columnar active substrate region (304). Hence, the electrical connection (402) may be used as a sort of second bitline [BL′] for purposes of program and erase, while the first bitline [BL] may be used for purposes of read only (and allowed to float during program and erase). Fourth, no source contact (114) is needed because local interconnect is used to connect to the sources (104).

[0038]FIG. 4B is a cross-sectional schematic diagram illustrating a single-transistor flash memory cell along a wordline in accordance with one embodiment of the present invention. As shown in FIG. 4B, the floating gate (106) is formed by a first polysilicon layer (polyl), and the control gate (108) is formed by a second polysilicon layer (poly2).

[0039] As discussed above, the present invention also relates to providing for connections to the flash cells, including separate access to each columnar active substrate region (304). Three options for providing for the connections are illustrated in FIGS. 5A, 5B, and 5C.

[0040]FIG. 5A is a first top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention. The first layout includes floating gates (FG) (106), drain contacts (112), Pwell contacts (402), drain lines [BL] (502), source lines (504), control lines [WL] (506), and substrate lines [BL′] (508). Note that the layout dimensions are meant for purposes of illustration and are not meant to be exact. In addition, the Pwell contacts (402) need not be made for every cell or pair of cells, but only as often as needed to keep the columnar substrate resistance sufficiently low.

[0041] In the first layout, the floating gates (106) are shown as formed from a first polysilicon layer (polyl). The floating gates (106), of course, float in that they are surrounded by silicon dioxide and hence there are no direct connections to them. The drain lines (502) are shown as formed from a first metal layer (metal1) and are connected to the drains (102) via the drain contacts (112). Voltages on the drain lines (502) are represented by Vd. The source lines (504) are shown as formed from local interconnect (LI) and are connected to the sources (104). Because local interconnect is used, no source contact is needed to connect to each source (104). Voltages on the source lines (504) are represented by Vs. As shown in FIG. 5A, the source lines (104) comprise common source lines which are each shared by a pair of rows of cells. Local interconnect is a metal conductor capable of crossing isolation regions such as trenches. Local interconnect is also situated in a different level than regular metal (aluminum) or polysilicon, allowing it to cross those conductors. In one embodiment, the local interconnect comprises tungsten. The control lines (506) are shown as formed from a second polysilicon layer (poly2). Above each floating gate (106), a control gate (108) is formed by a control line (506). Voltages on the control lines (506) are represented by Vg. Finally, the substrate lines (508) are shown to be formed from a third polysilicon layer (poly3) and are connected to the isolated columnar active substrate regions (304) via the Pwell contacts (402). Voltages on the substrate lines (508) are represented by Vb.

[0042] The Pwell contacts (402) shown in FIG. 5A comprise in-array contacts to the isolated columnar active substrate regions (304). Such contacts (402) along the columns are necessary for cases where substrate resistance is sufficiently high to require them. Otherwise, if contacts are only made at the ends of the columnar active substrate regions (304), then the voltage drops due to the substrate resistance would substantially effect the substrate voltage (Vb) at cells towards the middle of the columns. An estimated cell size with the first layout using 0.25 micron fabrication technology would be 0.75 microns (along the row direction)×1.1 microns (along the column direction)=0.825 square microns.

[0043]FIG. 5B is a second top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention. In comparison with the first layout in FIG. 5A, the second layout in FIG. 5B includes neither in-array Pwell contacts (402) nor substrate lines (508). In this case, contacts (not shown) are only made at the ends of the columnar active substrate regions (304) because substrate resistance is sufficiently low. The second bit lines [BL′] (not shown), which are utilized during program and erase, are connected to these end-of-column contacts. Without the in-array Pwell contacts (402), the second layout should allow for a smaller cell size than the first layout. An estimated cell size with the second layout using 0.25 micron fabrication technology would be 0.75 microns (along the row direction)×0.9 microns (along the column direction)=0.675 square microns.

[0044]FIG. 5C is a third top-view schematic layout illustrating connection layers and contacts to flash memory cells in accordance with one embodiment of the present invention. In comparison with the second layout in FIG. 5B, the third layout in FIG. 5C includes a different configuration for the source lines (504). Like the second layout, the third layout does not include in-array Pwell contacts (402) and hence has an estimated cell size of 0.675 square microns using 0.25 micron fabrication technology.

[0045] The n-diffusion opening areas drawn in FIG. 5C designate openings in the mask used for n-diffusion implantation. The diffusion regions will only be formed in the active regions, and therefore the sources (104) of cells located along the same wordline will not be connected.

[0046] In the connection scheme presented in FIG. 5C, the source (104) of a selected cell is permanently connected to the substrate (304). In a conventional scheme, a common source is typically grounded during program operation and either floating (for channel erase) or under positive voltage (for source erase) during erase operation. In the present scheme, the source (104) cannot be kept floating during erase operation, but this does not preclude erasing the cell. For programming operation, all sources are connected together which can be realized by tying all LI lines.

[0047] Table 1 shows an example of operational conditions for a flash memory cell in accordance with the present invention. The voltages shown in Table 1 are for purposes of illustration only and are meant to be neither exact nor limiting. TABLE 1 Operational Conditions Operation Vg [WL] Vd [BL] Vb [BL′] Vs Program   9 V float −7 V float Erase −8 V float   9 V float Read   4 V 0.8 V   0 V 0 V Program Inhibit   9 V float   0 V float

[0048] Note that while the source voltage Vs is shown as floating during certain operations in the above table, the source voltage Vs is actually allowed to float only under the layout schemes of FIGS. 5A and 5B. Under the layout scheme of FIG. 5C, the source voltage Vs would generally be equal to the substrate voltage Vb.

[0049] Table 2 shows another example of operational conditions for a flash memory cell in accordance with the present invention. The voltages shown in Table 2 are for purposes of illustration only and are meant to be neither exact nor limiting. TABLE 2 Operational Conditions Operation Vg [WL] Vd [BL] Vb [BL′] Vs Program   9 V float −9 V float Erase −8 V float   8 V float Read   4 V 0.8 V   0 V 0 V Program Inhibit   9 V float   0 V float

[0050] Table 3 shows yet another example of operational conditions for a flash memory cell in accordance with the present invention. The voltages shown in Table 3 are for purposes of illustration only and are meant to be neither exact nor limiting. TABLE 3 Operational Conditions Operation Vg [WL] Vd [BL] Vb [BL′] Vs Program 6 to 9 V float −6 to −9 V float Erase −5 to −8 V float 5 to 8 V float Read 3 to 4 V 0.5 to 0.8 V 0 V 0 V Program Inhibit 6 to 9 V float 0 V float

[0051] In one embodiment, the programming operation is performed by injecting hot electrons into the floating gate from a source side of a cell. For programming cells to “0”, a selected word line is provided with about 8 volts, a non-selected word line is floated or grounded, a selected bit line is grounded, a non-selected bit line is floated, and a common source is provided with about 4 volts.

[0052] For programming cells to “1”, a selected word line is provided with about −8 volts, a non-selected word line is floated or grounded, a selected bit line is provided with about 4 volts, a non-selected bit line is floated, and a common source is provided with about 4 volts.

[0053] For the source programming methods above, the source region is a double-diffused junction including Phosphorous and Arsenic. Phosphorous is implanted and diffused to a concentration of about 3×10¹⁴ cm⁻², using about 50 keV. Arsenic is implanted and diffused to a concentration of about 4×10 cm⁻, using about 60 keV. The source region preferably is formed deeper into the substrate than the drain region and have more abrupt profile with respect to the surface of the substrate than the drain region. The source side programming enables the cells to be programmed faster than the drain side programming.

[0054] The present invention is advantageous in several aspects. First, flash memory in accordance with the present invention is well suited for multi-level storage operation in which more than 1 bit of information is stored per cell. The independent access to each of the columnar regions (304) provides a capability for achieving more precise control over the voltages stored on the floating gates (106). More precise control over the stored voltages is required for multi-level storage. Since each bitline may be accessed individually during erase, as well as during programming, a capability is provided to correct for any “overshoot” during the programming of a bit. Programming algorithms may use this capability in order to achieve the storage of more precise voltage levels which is needed for multi-level storage. In addition, additional precision in the control of the stored voltage levels is provided by the present invention because both programming and erase utilize a tunneling mechanism, instead of an injection mechanism.

[0055] Second, the independent access to each of these columnar regions (304) provides a capability for areas of flash memory smaller than an entire sector to be erased at one time. As few as one or two wordlines (also called rows) may be erased at one time by applying a high negative voltage to control gates (108) of the selected wordline, a small positive voltage to the control gates (108) of the unselected wordlines, and a high positive voltage to the active substrate regions (304), while allowing the sources (104) and drains (102) to float. In this case, the n-polarity source line (504) corresponding to the selected wordline (108) floats to a level near to the positive voltage of the active substrate regions (304). Hence, if each wordline has its own source line (504), then one wordline at a time may be so erased. Alternatively, if two neighboring wordlines share a common source line (504), then two neighboring wordlines at a time may be so erased.

[0056] Moreover, even smaller than one or two word lines, a single cell may be erased by applying a high negative voltage to the control gates (108) in a selected wordline, grounding the columnar active substrate region (304) corresponding to a particular bitline, and allowing the drains (102) and sources (104) to float. In this case, no voltage is induced in the floating source line (504). Hence, a single cell may be erased.

[0057] Finally, since both programming and erasing is achieved by way of cold electron tunneling from the columnar active substrate region (304), no high voltages need to be applied to either the drain (102) or source (104). As can be seen from Table 1, the highest voltage applied between drain and source occurs during the read operation and is less than one volt. This reduces the minimum distance required by cell punchthrough; hence, the size of the cell may be reduced along the columnar direction. As a result, higher densities of flash memory may be achieved.

[0058] In another embodiment, the columnar active substrate region is divided into a plurality of sub-columnar active regions to increase the erase and program operations. The smaller structural area of the sub-columnar active regions provide faster charge and discharge time than the larger columnar active substrate region. Although the examples provided below illustrates a columnar active substrate region being divided into two sub-columnar active regions, it may be divided into three or more sub-columnar active regions in other embodiments.

[0059]FIG. 6A shows a top-view schematic layout of a flash memory device (600) including deep trenches (602, 604, 606, and 608), a plurality of sub-columnar active regions (610 and 612), and a plurality of connecting regions (614). Each connecting region (614) electrically couples the upper sub-columnar active region 610 and the lower sub-columnar active region 612, as explained in more detail later.

[0060] In one implementation, the active regions (610 and 612) are P wells and the connecting regions (614) are N wells. In the same or different implementation, a plurality of flash memory cells are provided in an NAND-gate structure on the sub-columnar active regions. The NAND gate structure generally occupies a significantly less area than a NOR-gate structure for comparable storage capacity.

[0061]FIG. 6B shows a schematic cross-sectional view of the device (600) taken along the arrows AA. Pass transistors (616 and 618), separated by a contact well (620), are formed on-the connecting region (614). The contact well (620) is a P well coupled to a second metal layer M2 or the global bitline. The global bitline is coupled to and applies voltages to a plurality of bitlines. A diffusion region (622) of the transistor (616) is coupled to the a diffusion region (624) of the sub-columnar active region 610. The diffusion regions (622 and 624) are coupled to a first metal layer M1 or local interconnect that provides voltages to the sub-columnar active region (610). The active region (610) includes a plurality of flash memory cells formed thereon in a NAND gate structure (not shown).

[0062] Similarly, a diffusion region (626) of the transistor (618) is coupled to the a diffusion region (628) of the sub-columnar active region 612. The diffusion regions (626 and 628) are coupled to a first metal layer M1 that provides voltages to the sub-columnar active region (612). The active region (612) includes a plurality of flash memory cells formed thereon in an NAND gate structure (not shown).

[0063] In operation, the pass transistors are used to regulate the current flow to the sub-columnar active regions. For example, the transistor (616) is turned on to apply a particular voltage to the sub-columnar active region (610) or turn off to the cut of the voltage supply. Similarly, the transistor (618) is turned on or off to regulate voltages being applied to the sub-columnar active region (612). The voltages being applied to the sub-columnar active regions corresponds to the voltages supplied by the second metal layer M2 via the contact well (620).

[0064] In one implementation, the connecting region (614) is an N well, and the sub-columnar active regions (610 and 614) are P wells.

[0065] In one implementation, the diffusion regions 624 and 628 of the sub-columnar active regions (610 and 612) are an N+/P+ region, so that a single N well may be used for two adjacent flash memory cells. FIG. 6C illustrates an example one such N+/P+ region 630.

[0066] Having fully described the present invention using the specific embodiments disclosed above, other equivalent or alternative methods of practicing the present invention will be apparent to those skilled in the art. Accordingly, the present invention is not intended to be limited by the specific embodiments and implementations provided above. The scope of the present invention, therefore, should be interpreted using the appended claims. 

What is claimed is:
 1. A flash memory device, comprising: a substrate having first and second wells, the first well being defined within the second well; a plurality of trenches defining the substrate into a plurality of sub-columnar active regions, the trenches being formed within the first well and extending into the second well; and a plurality of flash memory cells formed on each of the sub-columnar active regions.
 2. The memory device of claim 1, wherein the flash memory cells are arranged in a NAND gate structure.
 3. The memory device of claim 1, wherein the trenches comprise deep trenches filled with oxide.
 4. The memory device of claim 1, wherein the trenches comprise deep ion implants below shallow trenches filled with oxide.
 5. The memory device of claim 1, further comprising: a connecting region defined by the plurality of trenches, the connecting region being configured to electrically couple two sub-columnar active regions aligned to each other along an axis.
 6. The memory device of claim 5, wherein the connecting region includes at least two pass transistors to regulate flow of currents to the two sub-columnar active regions.
 7. The memory device of claim 1, wherein connections to sources of the flash memory cells are formed using local interconnect.
 8. The memory device of claim 1, wherein individual access to bitlines during erase is provided by way of the sub-columnar active regions.
 9. The memory device of claim 8, wherein during programming an algorithm uses the individual access to bitlines to correct for overshoot.
 10. The memory device of claim 9, wherein more than one bit of data is stored per memory cell.
 11. The memory device of claim 1, wherein an area smaller than an entire sector is erased at one time.
 12. The memory device of claim 1, wherein the first well is a P well and the second well is an N well.
 13. The memory device of claim 1, wherein the sub-columnar active regions provide faster erase and program operations than columnar active substrate regions.
 14. A flash memory device, comprising: a substrate having first and second wells, the first well being defined within the second well; a plurality of deep trenches defining the substrate into a plurality of bitlines, the trenches being formed within the first well and extending into the second well, each bitline including: a first region defined by the trenches whereon a plurality of flash memory cells are provided, and a second region defined by the trenches whereon a plurality of flash memory cells are provided, the first and second regions being separated by the trenches; a connecting region having first and second pass transistors to regulate flow of currents to the first and second regions.
 15. The device of claim 14, wherein the first and second pass transistors are coupled to a global bitline.
 16. The device of claim 15, wherein the first and second pass transistors share a common diffusion region that is coupled to the global bitline.
 17. The device of claim 14, wherein each bitline includes the connecting region.
 18. The device of claim 14, wherein the first and second regions include N+/P+ contact regions that are coupled to the first and second pass transistors.
 19. A method for programming a non-volatile memory device, comprising: applying a first voltage to a selected control gate; and applying a second voltage to a source, the second voltage being a positive potential.
 20. The method of claim 19, wherein the first voltage is about 8 volts in absolute term and the second voltage is about 4 volts, the method further comprising: grounding a selected bit line if the programming operation involves programming a cell to logic “0”; and applying the selected bit line with a third voltage if the programming operation involves programming the cell to logic “1”, wherein the second voltage is substantially the same as the third voltage in magnitude. 